IARG-AnCora: Anotación de los corpus AnCora con argumentos implícitos
نویسندگان
چکیده
Iarg-AnCora aims to annotate the implicit arguments of deverbal nominalizations in AnCora corpus. This corpus will be the basis for systems of automatic semantic role labeling based on machine learning techniques. Semantic analyzers are essential components in the current applications of language technologies, in which it is important to obtain a deeper understanding of the text to make inferences on the highest level in order to obtain qualitative improvements in the results.
منابع مشابه
Hacia una anotación de dependencias enriquecida de corpus españoles
We present a cost-effective strategy for the creation of a mid-size fine-grained Spanish dependency tree bank of surface-, deep-syntactic and semantic structures as defined in the Meaning-Text Theory. The strategy starts from a small seed dependency corpus, the AnCora corpus, whose annotation is considerably more coarse-grained than our target annotation. We show that this discrepancy can be br...
متن کاملAprendizaje de argumentos verbales completos y su plausibilidad en oraciones a partir de corpus
Resumen. El aprendizaje de preferencias de argumentos de verbos usualmente se ha tratado como un problema de verbo y argumento, o a lo mucho como una relación trinaria entre sujeto, verbo y objeto. Sin embargo, la correlación simultánea de todos los argumentos en una oración no ha sido explorado a profundidad para la medida de plausibilidad de una oración debido al alto número de combinaciones ...
متن کاملFrom constituents to syntax-oriented dependencies De constituyentes a dependencias de base sintáctica
This paper describes the automatic process of building a dependency annotated corpus based on Ancora constituent structures. The Ancora corpus already has a dependency structure information layer, but the new annotated data applies a purely syntactic orientation and offers in this way a new resource to the linguistic research community. The paper details the process of reannotating the corpus, ...
متن کاملAnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora
In this paper we present two large-scale verbal lexicons, AnCora-Verb-Ca for Catalan and AnCora-Verb-Es for Spanish, which are the basis for the semantic annotation with arguments and thematic roles of AnCora corpora. In AnCora-Verb lexicons, the mapping between syntactic functions, arguments and thematic roles of each verbal predicate it is established taking into account the verbal semantic c...
متن کاملAnCora: Multilevel Annotated Corpora for Catalan and Spanish
This paper presents AnCora, a multilingual corpus annotated at different linguistic levels consisting of 500,000 words in Catalan (AnCora-Ca) and in Spanish (AnCora-Es). At present AnCora is the largest multilayer annotated corpus of these languages freely available from http://clic.ub.edu/ancora. The two corpora consist mainly of newspaper texts annotated at different levels of linguistic desc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Procesamiento del Lenguaje Natural
دوره 49 شماره
صفحات -
تاریخ انتشار 2012